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Abstract —  Most  application  data  units  are  too  large  to  be 
carried  in  a  single  packet  (or  cell)  and  must  be  segmented  for 
network  delivery.  To  an  application,  the  end-to-end  delays  and 
loss  rate  of  its  data  units  are  much  more  relevant  performance 
measures  than  ones  specified  for  individual  packets  (or  cells). 
The  concept  of  a  burst  (or  block)  was  introduced  to  represent 
a  sequence  of  packets  (or  cells)  that  carry  an  application  data 
unit.  In  this  paper,  we  describe  how  a  real-time  variable  bit-rate 
(VBR)  service,  with  quality  of  service  (QoS)  parameters  for  block 
transfer  delay  and  block  loss  rate,  can  be  provided  by  integrating 
concepts  and  delay  guarantee  results  from  our  previous  work  on 
burst  scheduling,  together  with  ideas  from  asynchronous  transfer 
mode  (ATM)  block  transfer.  Two  new  contributions  are  presented 
herein.  First,  we  design  an  admission  control  algorithm  to  provide 
the  following  two  classes  of  service:  bounded-delay  block  transfer 
with  no  loss,  and  bounded-delay  block  transfer  at  a  specified 
block  loss  rate.  Secondly,  we  show  how  to  extend  existing  end-to- 
end  delay  bounds  to  networks  with  hierarchical  link  sharing. 

Index  Terms — Admission  control,  ATM  block  transfer,  burst 
scheduling,  delay  guarantee,  hierarchical  link  sharing. 


I.  Introduction 

IN  ALL  PACKET  switching  networks,  packets  have  a 
maximum  size  (in  number  of  bits).  Most  application  data 
units  are  too  large  to  be  carried  in  a  single  packet  and 
must  be  segmented  for  network  delivery.  To  an  application, 
the  end-to-end  delays  and  loss  rate  of  its  data  units  are 
more  relevant  performance  measures  than  ones  specified  for 
individual  packets.  For  example,  consider  an  application  that 
sends  live  video  over  an  asynchronous  transfer  mode  (ATM) 
network.  Each  picture  is  segmented  into  a  sequence  of  cells 
at  the  sender.  Clearly  the  delays  incurred  to  deliver  whole 
pictures  are  much  more  important  to  the  performance  of  the 
application  than  the  delays  of  individual  cells.  From  this 
observation,  we  introduced  the  concept  of  a  burst  to  represent 
a  sequence  of  packets  that  carry  an  application  data  unit,  and 
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designed  the  class  of  burst  scheduling  networks  to  provide 
delay  guarantees  to  bursts  [16],  [17]. 

The  ATM  block  transfer  (ABT)  capability  being  standard¬ 
ized  by  ITU-T  is  based  upon  a  similar  observation  [14]. 
The  objective  of  ABT  is  to  allow  a  bursty  data  source  to 
dynamically  negotiate  its  bandwidth  reservation  on  the  basis 
of  a  block  of  cells.  Note  that  a  higher  layer  protocol  data  unit, 
fragmented  into  a  number  of  ATM  cells,  is  lost  if  any  one  of 
its  cells  is  lost.  Therefore,  even  a  low  cell  loss  rate  can  cause 
a  significant  loss  rate  for  the  higher  layer  protocol.  As  a  result, 
the  higher  layer  protocol’s  throughput  may  be  much  less  than 
the  protocol  session’s  throughput  measured  in  delivered  cells. 
The  concept  of  a  block  was  introduced  to  represent  a  sequence 
of  cells,  which  may  contain  a  single  data  unit  or  multiple  data 
units  for  the  higher  layer  protocol.  A  block  is  bracketed  by  two 
resource  management  (RM)  cells.  A  leading  RM  cell  requests 
a  reserved  bandwidth  for  the  block,  and  a  trailing  RM  cell 
releases  the  reserved  bandwidth.  Cells  are  handled  in  blocks 
by  a  switch.  In  particular,  a  block  of  cells  is  either  discarded 
or  accepted  entirely.  (This  is  similar  to  the  idea  of  early  packet 
discard  proposed  in  recent  studies  on  IP  over  ATM  [22],  [25].) 

For  the  ABT  service,  the  concept  of  cell  loss  rate  can  be  gen¬ 
eralized  to  block  loss  rate.  Such  a  generalization  is  backward- 
compatible  with  the  existing  ATM  traffic  management  (TM) 
4.0  service  architecture  [10]  since  a  block  is  a  sequence  of 
cells,  with  a  single  cell  being  a  special  case.  Similarly,  the 
concept  of  cell  transfer  delay  for  real-time  variable  bit-rate 
(VBR)  services  can  be  generalized  to  block  transfer  delay 
which,  we  believe,  is  a  more  relevant  performance  measure 
to  many  applications;  for  example,  if  every  picture  in  a  video 
sequence  is  carried  by  a  block  of  cells,  then  the  block  transfer 
delays  are  the  same  as  picture  delays. 

In  this  paper,  we  describe  how  a  real-time  VBR  service, 
called  real-time  block  transfer,  with  quality  of  service  (QoS) 
parameters  for  block  transfer  delay  and  block  loss  rate  can  be 
provided  by  integrating  concepts  and  delay  guarantee  results 
from  our  previous  work  on  burst  scheduling  [16]  and  group 
priority  [17],  together  with  ideas  from  ATM  block  transfer.  In 
particular,  we  present  an  admission  control  algorithm  for  real¬ 
time  block  transfer  that  supports  the  following  two  service 
classes;  bounded-delay  block  transfer  with  no  loss  (i.e.  deter¬ 
ministic  delay  guarantee),  and  bounded-delay  block  transfer  at 
a  specified  block  loss  rate.  The  algorithm  has  been  evaluated 
using  a  simulator  driven  by  MPEG  video  traces. 

It  is  envisioned  that  future  integrated  services  networks  will 
support  not  only  link  sharing  by  multiple  service  classes  (real¬ 
time  service,  best-effort  service,  etc.)  but  also  by  multiple 
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Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
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administrative  classes  (different  agencies  and  organizations) 
[9].  Specifically,  packets  (or  cells)  from  sessions  belonging 
to  different  service  classes  and  administrative  classes  interact 
with  one  another  when  they  are  statistically  multiplexed  at 
an  output  link  of  a  switch.  The  link’s  packet  scheduling 
algorithm  plays  an  important  role  in  controlling  such  link 
sharing.  Hierarchical  link  sharing  has  been  proposed  as  a 
solution  [1],  In  this  paper,  we  also  describe  a  general  approach 
for  extending  the  end-to-end  delay  bounds  for  bursts1  from 
networks  in  which  links  are  shared  by  service  classes  only 
[16],  [17]  to  networks  in  which  links  are  hierarchically  shared 
(e.g.,  by  administrative  classes  first,  and  then  by  service  classes 
within  each  administrative  class).  Specifically,  each  logical 
server  in  a  link-sharing  hierarchy  is  modeled  as  a  fluctuation 
constrained  (FC)  server  [18].  Two  theorems  are  presented. 
They  can  be  used  to  derive  block  delay  guarantees  of  FC 
servers  based  on  existing  delay  guarantee  results  of  constant- 
rate  servers.  The  theorems  are  general;  they  are  proved  for  a 
large  class  of  well-known  servers. 

The  balance  of  this  paper  is  organized  as  follows.  In 
Section  II,  the  basic  idea  of  block-based  admission  control 
for  ABT  and  the  concept  of  burst  scheduling  are  described. 
The  end-to-end  delay  bounds  of  burst  scheduling  networks 
are  shown.  In  Section  III,  an  admission  control  algorithm  for 
real-time  block  transfer  services  is  presented.  Experimental 
results  from  using  the  algorithm  are  discussed.  In  Section  IV, 
hierarchical  link  sharing  is  introduced.  Also  described  is  a 
general  approach  for  extending  end-to-end  delay  bounds  to 
networks  with  hierarchical  link  sharing. 

II.  Real-Time  Block  Transfer 

A.  Block-Based  Admission  Control 

In  ABT,2  dynamic  bandwidth  reservation  and  allocation  for 
a  block  of  cells  can  be  carried  out  in  two  ways:  1)  ABT  with 
delayed  transmission  (ABT/DT)  and  2)  ABT  with  immediate 
transmission  (ABT/IT).  For  our  discussion,  the  focus  is  on 
the  latter.  In  ABT/IT,  the  block  is  sent  immediately  after  a 
preceding  RM  cell,  which  contains  a  request  for  a  cell  rate 
and  a  cell  delay  variance.  The  block  proceeds  on  a  switch- 
by-switch  basis,  with  each  switch  either  forwarding  the  block 
with  guaranteed  QoS  for  every  cell  in  the  block  or  discarding 
the  entire  block  if  a  required  resource  such  as  bandwidth  is 
not  available.  In  other  words,  the  switches  perform  admission 
control  on  a  block-by-block  basis. 

With  block-based  admission  control,  cell  losses  are  con¬ 
centrated  over  a  small  number  of  blocks,  and  bandwidth  is 
not  wasted  on  delivery  of  partial  blocks.  Therefore,  ABT  is 
able  to  avoid  the  situation  in  which  cell  losses  spread  over  a 
large  number  of  higher  layer  data  units,  causing  throughput 
degradation  of  such  data  units.  With  block-based  admission 
control,  ABT  is  also  able  to  offer  QoS  measured  in  terms 
of  blocks.  In  particular,  the  concept  of  cell  loss  rate  can  be 
generalized  to  block  loss  rate. 

'End-to-end  delay  bounds  for  blocks  are  obtained  by  specifying  a  burst 
[16],  [17]  to  represent  a  block  of  ATM  cells. 

-A  short  overview  of  ABT  can  be  found  in  [2], 


B.  Burst  Scheduling 

For  burst  scheduling  networks  [16],  we  model  a  flow  as 
a  sequence  of  bursts,  each  of  which  models  a  sequence  of 
packets  that  carries  an  application-specific  data  unit.  A  burst 
corresponds  to  a  block  in  ABT  with  some  minor  differences 
in  detail.  In  particular,  instead  of  two  special  packets  being 
used,  the  first  packet  of  each  burst  is  marked  and  stored  in  it 
is  information  on  the  size  of  the  burst  (in  number  of  packets) 
and  the  average  rate  of  the  burst.  Moreover,  for  efficient  packet 
scheduling  and  delay  jitter  control,  packets  of  each  burst  satisfy 
a  jitter  timing  constraint  [16]. 

The  following  delay  bound  results  are  taken  from  [16]  and 

[17]. 

End-to-End  Delay  Bounds  for  Burst  Scheduling  Networks: 
Consider  a  flow  that  traverses  a  sequence  of  nodes,  indexed 
by  0, 1,  2,  •  •  • ,  K  +  1,  where  node  0  denotes  the  source,  node 
K  +  1  the  destination,  and  the  other  nodes  packet  switches.  If 
the  capacity  of  every  link  on  the  network  path  is  not  exceeded, 
then  the  end-to-end  delay3  of  every  burst  m  of  the  flow, 
denoted  by  Dm,  m  —  1,  2,  •  •  • ,  has  the  following  upper  and 
lower  bounds: 

Dm  <  bn\+  1  +{K  -  1)  max  {  +  Y'  ak  (1) 

Am  l<n<m  [ A„  J  “ 

1  A' 

Dm  >  ( K  —  1)  - - b  Y"'  CYfc  (2) 

tz 1 

where 

bm  size  of  burst  m  (packets); 

A,„  average  rate  of  burst  m  (packets/second); 

ak  small  constant  associated  with  the  link  from  switch  k 
to  k  +  1. 

While  our  design  of  burst  scheduling  networks  consists  of 
specialized  components  such  as  source  and  flow  regulators, 
the  concept  of  burst  scheduling,  i.e.,  modeling  a  flow  as  a 
sequence  of  bursts  and  providing  bounded  delays  to  bursts 
(blocks)  when  link  capacity  is  not  exceeded,  is  quite  general. 
It  can  be  realized  by  many  designs  other  than  ours.  In  the 
balance  of  this  paper  we  refer  to  burst  scheduling  as  a  general 
concept  independent  of  the  design  details. 

C.  Integration  of  ABT  and  Burst  Scheduling 

ABT  is  able  to  minimize  block  losses  through  the  use  of 
block-based  admission  control  and  provide  low  block  loss  rate 
as  QoS.  Burst  scheduling  networks  provide  bounded  block 
transfer  delays  as  QoS  when  link  capacity  is  not  exceeded. 
Integrating  the  concepts  and  results  from  ABT  and  burst 
scheduling,  we  define  a  real-time  VBR  service,  called  real¬ 
time  block  transfer,  that  provides  the  following  two  classes 
of  services:  1)  bounded-delay  block  transfer  with  zero  block 
loss  and  2)  bounded-delay  block  transfer  at  a  specified  block 
loss  rate. 

Admission  control  at  the  flow  level  is  the  key  for  our 
integration.  For  class  1,  peak  rate  reservation  can  be  used  for 

1  Measured  from  the  time  when  the  first  cell  enters  the  network  to  the  time 
when  the  last  cell  leaves  the  network. 
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Fig.  1.  System  model  for  admission  control. 

admission  control  to  ensure  that  the  link  capacity  allocated  to 
this  class  is  not  exceeded  without  discarding  blocks.  For  class 
2,  overbooking  of  the  class’s  allocated  capacity  is  allowed 
at  the  time  of  connection  setup  while  block-based  admission 
control  is  used  to  ensure  that  link  capacity  is  not  exceeded  at 
any  time  by  discarding  blocks  if  necessary.  To  limit  the  block 
loss  rate  to  a  specified  value,  the  extent  of  overbooking  is 
controlled  by  flow  level  admission  control.  In  next  section,  we 
will  describe  in  detail  a  flow  level  admission  control  algorithm 
that  we  have  designed  for  real-time  block  transfer  services.4 

III.  Admission  Control 

Consider  a  flow  whose  source  requests  a  real-time  block 
transfer  service.  We  assume  that  at  the  time  of  connection 
setup,  the  source  supplies  the  network  two  sets  of  flow 
parameters:  1)  QoS  parameters:  block  loss  probability  (BLP) 
and  block  transfer  delay  bound  (BTD)5  and  2)  traffic  param¬ 
eters — sustained  cell  rate  (SCR),  peak  cell  rate  (PCR),  and 
cell  rate  variation  (CRV).  Whether  or  not  to  admit  the  flow 
is  a  decision  made  by  each  switch  in  the  path  of  the  flow. 
Specifically,  each  switch  in  the  path  accepts  the  flow  only  if 
doing  so  will  not  cause  violation  of  QoS  guarantees  to  accepted 
flows;  the  network  admits  the  flow  only  if  all  switches  in  the 
path  accept  the  flow. 

A.  System  Model 

Our  system  model  for  admission  control  by  a  particular 
switch  is  shown  in  Fig.  1.  There  are  M  classes  of  real-time 
block  transfer  service.  They  share  a  link6  with  capacity  C  b/s. 
Each  class  s,  s  —  1,  2,  •  •  • ,  M  is  associated  with  a  weight  <f>s , 
which  is  a  relative  measure  of  the  class’s  allocated  share  of 
the  link  bandwidth.  For  ease  of  presentation  and  without  loss 
of  generality,  we  assume  that  <f>s  —  1.  Therefore,  class 

s  has  a  share  of  the  link  equal  to  rs  —  (f>s  •  C  (b/s).  Each  class 
offers  a  target  block  loss  rate  denoted  by  p.  Without  loss  of 
generality,  we  assume  that 

0  —  pi  <  P2  <  •••  <  Pm  Cl¬ 
in  other  words,  class  1  provides  deterministic  service,  i.e.,  class 
1  service  defined  in  Section  II-C,  and  the  other  classes  provide 

4  The  algorithm  will  be  presented  in  the  context  of  ATM  networks.  Our 
design  should  be  applicable  to  other  types  of  networks. 

3  BTD  will  not  be  considered  further  in  our  design  of  admission  control, 
assuming  that  an  appropriate  burst  scheduling  algorithm  is  used  by  the  network 
to  ensure  block  delay  guarantees. 

6  The  link  could  be  a  logical  one;  see  Section  IV. 


Fig.  2.  A  flow  modeled  by  blocks. 

different  levels  of  statistical  service,  i.e.,  class  2  service  defined 
in  Section  II-C. 

We  also  assume  that  an  appropriate  scheduling  algorithm 
(such  as  WF2Q+  [1])  is  used  by  the  link  to  provide  a  firewall 
between  the  service  classes  and  guarantee  each  class  its  link 
share  (see  Section  IV).  As  a  result,  admission  control  for  each 
class  can  be  performed  independently. 

B.  Preliminary 

In  addition  to  the  system  model,  there  are  a  couple  of  factors 
(or  implicit  assumptions)  that  are  key  to  understanding  our 
flow  level  admission  control  algorithm.  They  are  discussed 
next. 

1 )  Traffic  Model:  In  what  follows,  a  flow  is  modeled  as 
a  sequence  of  blocks,  each  of  which  models  an  application 
data  unit.  The  first  and  last  packet  of  each  block  is  marked, 
and  the  first  packet  carries  the  bandwidth  requirement  (i.e.,  the 
average  rate)  of  the  block.  Fig.  2  illustrates  a  flow  modeled  by 
blocks.  (Am  denotes  the  bandwidth  requirement  of  block  to.) 
Note  that  unlike  our  burst  flow  specification  [16],  interpacket 
spacing  within  a  block  is  not  specified  in  this  traffic  model. 
This  is  because  such  spacing,  while  having  a  direct  impact 
on  block  delays  and  block  delay  jitter,  does  not  significantly 
affect  admission  control. 

2)  Algorithm  Specification  of  Block-Based  Admission  Con¬ 
trol:  Block-based  admission  control  by  a  service  class  is 
formally  specified  below.  Let  r  be  the  link  capacity  allocated  to 
the  service  class.  The  variable  A  is  used  to  store  the  aggregate 
rate  allocated  by  the  service  class,  and  is  initialized  to  0. 

•  Upon  arrival  of  first  packet  of  block  to 

Block -Admission -Control  (to) 

1  if  (A  +  Am  >  r) 

2  then  discard  block  to; 

3  else  admit  block  to; 

4  A  A  +  A,„; 

•  upon  departure  of  last  packet  of  block  to 

A:—  A  —  Am. 

The  above  algorithm  has  extremely  low  processing  cost. 
Furthermore,  it  is  performed  only  once  per  block.  Note  that 
the  average  interblock  arrival  time  is  usually  much  larger  than 
the  average  interpacket  arrival  time.  (This  is  especially  true 
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in  an  ATM  network.)  Therefore,  the  algorithm  is  suitable  for  as  n  — >  oo,  where  N( 0, 1)  is  the  standard  normal  distribution, 
high  speed  networks.  Therefore,  we  can  approximate  (Pr(Tg(f)  >  0)  <  ps)  by 


C.  Derivation  of  Flow  Level  Admission  Control  Conditions 

For  flows  requesting  the  deterministic  service,  admission 
control  should  be  based  on  peak  rate  reservation  so  that  the 
class’s  link  share  would  never  be  exceeded  and  thus  no  block 
would  be  discarded.7 

Next  consider  a  statistical  class  s.  Flows  that  request  this 
class  of  service  can  tolerate  some  block  losses.  Therefore,  the 
admission  control  for  them  can  be  more  aggressive  to  increase 
utilization  of  the  class’s  bandwidth  by  taking  advantage  of 
statistical  multiplexing. 

Specifically  with  block-based  admission  control,  the  objec¬ 
tive  of  the  flow  level  admission  control  becomes  very  specific: 
to  allow  as  much  overbooking  as  possible  while  bounding  the 
probability  that  the  class’s  unallocated  capacity  is  not  sufficient 
for  a  newly  arrived  block — denoted  as  the  overflow  probability 
of  the  class — by  ps.  The  overflow  probability  and  statistical 
multiplexing  gains  are  closely  related.  In  particular,  if  two 
classes  have  the  same  overflow  probability,  the  utilization  is 
higher,  because  of  statistical  multiplexing  gains,  for  the  one 
that  has  a  larger  capacity  and  is  shared  by  more  flows.  In 
what  follows,  we  derive  a  set  of  conditions  that  are  sufficient 
to  limit  the  overflow  probability  of  class  s  at  approximately 
ps.  The  goal  is  to  find,  among  these  conditions,  one  that  is 
both  accurate  and  easy  to  check. 

Assume  that  the  service  of  the  class  is  currently  shared  by 
a  set  of  n  flows  (indexed  by  1,2,  •  •  •  ,n).  Let  BLPi,  SCRi, 
PCRi,  and  CRVi  be  the  QoS  and  traffic  parameters*  that  are 
supplied  to  the  switch  by  flow  i.  At  any  time,  with  block- 
based  admission  control,  at  most  one  block  from  each  flow 
has  its  reserved  rate  allocated.  Denote  Aft)  the  reserved  cell 
rate  for  the  block  of  flow  i  that  is  either  allocated  a  rate 
or  being  processed  by  block-based  admission  control  at  time 
t.  (  A,  (t)  —  0  if  there  is  no  such  block.)  In  our  analysis  below, 
Aj(f)’s  are  considered  as  independent  random  variables.  Define 

n 

Ys(t)  =  '^2\i(t)-rs.  (3) 

?=i 

Consider  a  block  that  is  being  processed  by  block-based 
admission  control  at  time  t.  It  will  be  discarded  if  ( Ys(t )  >  0). 
Therefore,  the  goal  is  to  find  conditions  sufficient  for 

Pr(T;(f)>o)  <ps.  (4) 


From  a  generalized  version  of  the  central  limit  theorem  [19] 
(included  in  the  Appendix  as  Theorem  3),  we  have 


Y(t)  -  E\Y,(t)] 
y/varfl's^)] 


iV(0,l) 


(5) 


7  If  the  switch  has  a  priori  knowledge  of  the  traffic  characteristics  of  flows 
in  the  class,  peak  rate  reservation  may  not  be  necessary.  But  it  is  unrealistic 
with  today’s  networks. 

8  Note  that  BLPj  is  the  probability  of  block  losses  through  one  switch  (i.e. 
a  single  hop)  required  for  flow  i.  We  discuss  how  to  distribute  an  end-to-end 
block  loss  requirement  to  individual  switches  in  a  different  report  [28]. 


pr  i>#WL  <Ps 

\  yvaxp;^)]/ 

where  X  ~  iV(0, 1). 

The  following  condition  is  sufficient  for  (6): 

-£K(*>1  >z. 


«s) 


\J  varfl'g  (£)] 
tat  sat 

-£[V(*)] 


(7) 


where  Zs  is  the  constant  that  satisfies  (Pr(V  >  Z„)  —  ps ). 
Define 


Z  = 


v/varfr^)] 


We  have 


i= 1 

n 

var[T;(Q]  =  ^  vai'[A;(£)]. 


(8) 

(9) 

(10) 


?=1 


Therefore 


-  dw)] 


Z  = 


i= 1 


(ID 


■  ]T  vai'[A;(Q] 

\  i= 1 


Combining  (7)  and  (9)— ( 11),  we  have  the  following  sufficient 
condition  for  (4): 


i-L 


(12) 


We  refer  to  the  value  of  (Zs/Z)  as  the  statistical  multiplexing 
intensity  (SMI)  of  class  5.  It  should  never  exceed  the  threshold 
of  one  to  bound  the  block  rate  of  the  class  below  ps.  In 
practice,  it  is  difficult  to  obtain  the  exact  value  of  Z.  However, 
Z  can  be  estimated  as  follows: 


-  ^SCR, 


Z  = 


?=1 


(13) 


n£crv‘ 


In  summary,  the  following  admission  control  condition  can 
be  used  for  class  s: 

Zs 

-4-  <  1.  (14) 

z 

Note  that  the  source  of  flow  i  may  not  have  a  good  estimate 
of  CRVi  at  the  time  of  connection  setup.  In  such  a  case,  CRVi 
is  upper  bounded  by  SCRi*  (PCRi— SCRi),  which  can  be  used 
as  a  pessimistic  estimate.9  (See  Theorem  4  in  the  Appendix.) 

9  Moreover,  online  measurement  of  SCR  and  CRV  can  be  carried  out  for 
admitted  flows  to  improve  the  admission  control  accuracy  of  the  algorithm. 
We  are  currently  investigating  such  techniques.  Note  that  there  have  been 
several  recent  studies  on  measurement-based  admission  control,  e.g.,  [12]. 
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Fig.  3.  Simulated  network. 


D.  Algorithm  Specification 

The  following  admission  control  algorithm  follows  from  the 
analysis  in  the  previous  section.  The  variables  Bs  and  Vs  are 
used  to  store,  respectively,  the  available  bandwidth  and  the 
total  cell  rate  variance  of  class  s.  Initially,  Bs  —  rs  and 
Vs  —  0.  We  assume  that  if  a  source  does  not  have  a  good 
estimate  of  CRV  at  the  time  of  connection  setup,  it  will  let  the 
network  know  by  setting  CRV  =  —  1. 


1 

2 

3 

4 

5 

6 

7 

8 


[•'low  Admission  Control  (BLP,  PCR,  SCR,  CRV) 

s  find-dass(BLP); 
if  (s  =  1) 

then  if(i?i  -  PCR  >  0) 

then  accept  the  flow; 

Bx  :=  Si  -  PCR; 
else  reject  the  flow; 
else  if  (CRV  =  -1) 

then  CRV  :=  SCR* 


(PCR -SCR); 

9  SMI  :=  Z5  *  (sqrt(Vs  +  CRV)/ 

(Ss  -  SCR)); 


10 

if  ( SMI  <  1) 

11 

then  accept  the  flow; 

12 

Bs  :=  Bs  -  SCR; 

13 

V,  :=  V,  +  CRV; 

14 

else  reject  the  flow. 

The  algorithm  is  straightforward  and  simple  to  implement. 
First,  BLP  is  used  to  find  the  service  class  desired  by  the 
source.  Assume  the  class  found  is  5.  If  5  is  1,  the  source 
requests  the  deterministic  service.  Therefore,  the  admission 
decision  is  based  upon  PCR  of  the  flow  and  the  bandwidth 
currently  available  for  class  1 .  Otherwise,  the  source  requests 
a  statistical  service.  The  admission  decision  is  then  based  on 
the  projected  SMI  value  of  class  s. 

E.  Experimental  Results 

We  have  evaluated  the  admission  control  algorithm  by 
performing  a  set  of  simulation  experiments.  The  simulation 
configuration  is  shown  in  Fig.  3.  The  nodes  labeled  by  VS 
denote  groups  of  three  video  sources,  and  VD  their  destination. 
Each  video  source  generates  53-byte  packets  from  a  trace  file 
obtained  from  an  MPEG  video  sequence  and  packets  for  each 
MPEG  frame  (or  picture)  are  modeled  as  a  block.  A  profile 
of  the  MPEG  sequences  used  in  our  experiments  is  given  in 


Fig.  4.  Channel  utilization  improvement. 


TABLE  I 

Profile  of  MPEG  Sequences 


MPEG 

sequence 

encoding 

pattern 

picture 

rate 

(Mb/s) 

SCR 

CRV 

min 

ave 

max 

Airwolf 

(3,6) 

0.14 

0.89 

3.31 

0.89 

0.512 

Energizer 

(3.6) 

0.17 

0.76 

2.34 

0.76 

0.385 

SimpsonsI 

(3.6) 

0.14 

0.92 

2.60 

0.92 

0.482 

Hardboiled 

(0.1) 

0.23 

1.07 

2.67 

1.07 

0.136 

Terminator 

(3.6) 

0.14 

1.15 

3.86 

1.15 

1.194 

Canyon 

(3.6) 

0.08 

0.28 

0.70 

0.28 

0.046 

Jamie 

(3.6) 

0.04 

0.71 

2.87 

0.71 

0.515 

TheWalB 

(1,2) 

0.03 

1.07 

3.39 

1.07 

0.281 

Reds 

Nightmare 

(10,30) 

0.89 

0.75 

3.62 

0.75 

0.458 

UnderSiege 

(3.6) 

0.17 

0.59 

2.02 

0.59 

0.227 

Table  I.  Their  durations  vary  from  10  s  to  several  minutes. 
Each  sequence  is  used  by  10  or  11  video  sources. 

The  admission  control  algorithm  is  implemented  for  channel 
LI.  Each  of  the  video  sources  makes  a  reservation  with  the 
network,  and  starts  sending  packets  only  after  the  reservation 
is  successful.  In  our  experiments,  all  sources  requested  the 
same  class  of  statistical  service  with  a  target  block  loss  rate  of 
p.  At  the  channel,  packets  are  scheduled  based  on  their  virtual 
clock  values  [31],  The  capacity  of  LI  as  well  as  the  value  of 
p  were  varied  in  different  experiments.  Each  experiment  was 
run  for  10  s  of  simulated  time. 

1 )  Channel  Utilization:  In  Fig.  4,  the  channel  utilization  as 
a  function  of  the  target  block  (picture)  loss  rate  is  plotted.  The 
result  shows  that  the  channel  is  used  much  more  efficiently 
with  a  statistical  service  than  a  deterministic  service  (with 
zero  loss  rate).  The  price  to  pay  is  a  small  nonzero  picture 
loss  probability.  The  utilization  increase  is  more  significant 
with  a  higher  channel  capacity,  from  below  30%  to  above 
70%  in  the  case  where  the  capacity  of  LI  is  56  Mb/s.  This 
is  because  the  improvement  is  due  to  statistical  multiplexing 
gains,  which  are  larger  with  more  flows  sharing  the  channel. 
(Up  to  50  flows  were  accepted  when  the  capacity  was  set  to 
56  Mb/s  while  the  number  was  six  for  the  8-Mb/s  case.)  Note 
that  for  the  same  channel  capacity,  the  magnitude  of  utilization 
gain  levels  off  as  p  increases.  (A  similar  observation  was  first 
discussed  in  [20].)  Therefore,  our  admission  control  algorithm 
can  be  used  to  provide  low  block  loss  rate  while  achieving 
high  channel  utilization. 
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Fig.  5.  Actual  versus  expected  loss  rate. 


TABLE  II 

Actual  Loss  Rate 


Ll  = 

8  Mb/s 

Ll  = 

29  Mb/s 

Ll  = 

56  Mb/s 

Expected 

Actual 

Expected 

Actual 

Expected 

Actual 

0% 

0% 

0% 

0% 

0% 

0% 

0.10% 

0.28% 

0.10% 

0.22% 

0.10% 

0.16% 

0.25% 

0.76% 

0.25% 

0.35% 

0.25% 

0.25% 

0.51% 

1.52% 

0.51% 

0.44% 

0.51% 

0.27% 

0.75% 

1.49% 

0.75% 

0.65% 

0.75% 

0.35% 

0.99% 

1.77% 

0.99% 

0.72% 

0.99% 

0.35% 

2 )  Actual  Loss  Rate:  For  the  channel  utilization  gain  to 
be  meaningful,  the  actual  block  (picture)  loss  rates  in  the 
experiments  must  be  close  to  their  respective  target  values. 
In  Fig.  5,  we  compare  the  actual  picture  loss  rate  in  each 
experiment,  averaged  over  five  simulation  runs  using  different 
random  seeds,  with  the  target  value.  From  the  figure,  it  can 
be  concluded  that  our  admission  control  algorithm  predicts  the 
actual  loss  rate  well  when  a  large  number  of  flows  share  the 
channel.  (Around  30  flows  were  admitted  when  the  channel 
capacity  of  LI  was  set  to  56  Mb/s.)  This  agrees  with  our 
analysis;  the  larger  the  number  of  flows  sharing  the  channel, 
the  better  the  approximation  based  on  the  central  limit  theorem. 
Note  that  the  solid  45°  line  represents  perfect  prediction  by 
the  central  limit  theorem.  (The  exact  loss  rates  are  listed  in 
Table  II  for  reference.) 

F.  Related  Work: 

In  this  section,  we  review  some  related  work  in  the  area  of 
admission  control  for  statistical  service. 

Clark  et  al.  [4]  proposed  predicted  service  as  the  statistical 
service  component  of  their  integrated  services  model  for  the 
Internet.  Later,  Jamin  et  al.  [15]  designed  a  measurement- 
based  admission  algorithm  for  predictive  service  and  reported 
very  good  performance  in  network  utilization.  However,  pre¬ 
dictive  service  by  design  requires  that  applications  adapt  to 
large  changes  in  end-to-end  packet  delays.  It  is  not  appropriate 
for  applications  such  as  remote  teaching  and  video  on  demand. 
Guerin  et  al.  [13]  and  others  (e.g.,  [6])  proposed  to  use  effec¬ 
tive  bandwidth  for  admission  control.  While  the  admission 
condition  is  simple  using  effective  bandwidth,  sophisticated 


Fig.  6.  Link- sharing  tree  model. 

traffic  modeling,  policing,  and  monitoring  are  required.  In 
comparison,  our  block  traffic  model  is  simpler.10  Chong  et 
al.  [3]  and  others  proposed  to  use  neural  nets  and  fuzzy  logic 
combined  with  online  traffic  measurements  for  “intelligent” 
admission  control  and  reported  very  positive  experimental  re¬ 
sults.  However,  the  performance  of  such  an  algorithm  depends 
heavily  upon  how  the  neural  net  was  trained  and  the  data  used 
for  the  training. 

Note  that  none  of  the  above  proposals  considered  the 
performance  of  application  data  units,  which  we  believe  is 
more  important  to  applications  than  packet  performance. 

IV.  Hierarchical  Link  Sharing 

Existing  delay  bounds  for  blocks  (bursts),  e.g.,  the  ones 
presented  in  Section  II-B,  were  derived  for  networks  with  a 
flat  one-level  link-sharing  structure.  Specifically,  each  link  in 
the  networks  can  be  modeled  as  a  single  server.  In  this  section, 
we  discuss  how  to  generalize  the  delay  bounds  to  networks  in 
which  links  are  hierarchically  shared. 

A.  Directed  Tree  Model 

First,  we  describe  a  directed  tree  model,  largely  borrowed 
from  [1],  for  representation  of  a  link-sharing  hierarchy  (see 
Fig.  6).  The  root  node,  denoted  by  R.  corresponds  to  the 
physical  link,  each  leaf  node  corresponds  to  a  flow  with  a 
queue  of  packets,  and  each  nonleaf  node  (except  for  the  root) 
corresponds  to  a  link-sharing  entity,  e.g.,  an  administrative 
agency,  a  traffic  type,  or  a  service  class.  A  nonleaf  node  n  is 
called  backlogged  if  at  least  one  flow  in  its  leaf  descendent 
node  set ,  denoted  by  leaf(n),  is  backlogged.  Conceptually, 
node  n  is  a  logical  server  for  its  descendents.  The  amount 
of  work  done  by  n  in  the  time  interval  [Zi .  /_>]  is  defined  to 
be  Wn(ti,t2)  =  E/e ieaf(n)  Wf(h,t2),  where  W f{t1}t2)  is 

10  We  know  that  it  is  not  trivial  to  police  preventively  the  variance  of  a 
flow.  However  such  policing  is  not  required  in  our  algorithm.  Specifically 
when  processing  a  new  flow  request,  our  algorithm  does  not  need  an  accurate 
value  of  the  flow’s  rate  variance  because  of  the  existence  of  an  upper  bound 
on  variance  (see  Theorem  4  in  the  Appendix).  Once  admitted,  the  flow’s 
variance  and  average  rate  will  be  monitored  and  policed  if  necessary  based 
on  measurements.  Traffic  monitoring  and  policing  are  integral  parts  of  our 
framework.  Because  of  the  complexity  of  the  topic  and  space  limit,  we  have 
decided  to  treat  it  separately  in  a  future  report. 


36 


IEEE/ ACM  TRANSACTIONS  ON  NETWORKING,  VOL.  6,  NO.  1,  FEBRUARY  1998 


Fig.  7.  An  example  link- sharing  tree. 

the  amount  of  flow  /  traffic  served  in  the  interval  [Q.  G].  To 
achieve  link  sharing,  each  node  rn  is  assigned  a  weight  <f>m  >  0, 
which  is  a  relative  measure  of  the  link  share  desired  by  entity 
to.  For  ease  of  presentation  and  without  loss  of  generality,  we 
assume  that 


continuously  backlogged,  the  following  holds 

W„ (t i ,  t2)  >  r„  -{t2-ti)  (17) 

where  rn  —  </>,,  •  C ,  that  is,  server  n  is  guaranteed  its  link 
share  whenever  it  has  work  to  do. 

It  is  straightforward  to  show  that  if  every  logical  server  is  a 
fluid  model  fair  queueing  (FFQ)  server,11  then  ideal  link  shar¬ 
ing  will  be  achieved  for  every  logical  server.  Unfortunately, 
FFQ  is  not  feasible.  Therefore,  scheduling  algorithms  that  are 
good  approximations  of  FFQ  should  be  used  at  the  logical 
servers. 

B.  Prior  Work 

In  [1],  Bennett  and  Zhang  proposed  a  new  approximation 
of  FFQ,  named  worst-case  fair  weighted  fair  queueing  plus 
(WF2Q+),  for  providing  tight  delay  bounds  to  flows  under 
a  link-sharing  hierarchy.  They  defined  a  metric  for  a  logical 
server  called  normalized  bit  worst-case  fair  index  (NB-WFI). 

Definition  2:  A  logical  server  node  n  is  said  to  guarantee 
an  NB-WFI  of  cvm  „  for  its  child  to  if,  during  any  time  interval 
[t i ,  t2\  in  which  to  is  continuously  backlogged,  the  following 
holds: 


<t>R  =  1 

^  ^  ^  0n 

mC  children(n) 


V  non-leaf  node  n. 


(15) 

(16) 


Notation:  Let  H(jn)  be  the  number  of  ancestors  that  node 
to  has.  Let  ph(m)  be  the  ancestor  node  of  to  that  is  h 
levels  higher  than  to  in  the  tree.  Clearly  pl(jn)  —  parent (m) 
and  j>n< '"'(rn)  —  R.  For  ease  of  presentation,  we  also  set 
p°(m)  =  TO. 

Hierarchical  link  sharing  can  have  a  big  impact  on  the 
performance  of  flows  that  share  the  link.  Specifically,  packets 
of  a  flow  under  the  link-sharing  hierarchy  are  scheduled  jointly 
by  all  ancestor  nodes  of  the  flow  before  being  served  by  the 
link  [1].  We  illustrate  this  point  with  the  following  example. 

Example  1:  Consider  the  simple  link-sharing  hierarchy 
shown  in  Fig.  7.  Assume  that  all  logical  server  nodes  are 
currently  backlogged.  For  the  link  to  serve  one  of  the  available 
bit  rate  (ABR)  flows  from  agency  A1  next,  both  of  the 
following  conditions  have  to  be  met:  1)  it  is  determined 
by  the  scheduler  of  the  root  logical  server  that  it  is  agency 
Al’s  turn  to  receive  service  and  2)  it  is  determined  by  the 
scheduler  of  the  A1  logical  server  that  it  is  traffic  class  ABR’s 
turn  to  receive  service.  Therefore,  if  the  root  logical  server 
used  a  scheduling  algorithm  that  could  not  guarantee  agency 
A1  a  link  share  of  0.6,  then  the  performance  of  Al’s  VBR 
flows  would  be  affected  negatively. 

For  hierarchical  link  sharing  to  be  useful,  appropriate  sched¬ 
uling  algorithms  must  be  used  at  the  logical  servers  to  mini¬ 
mize  the  negative  impact  on  flow  performance.  We  next  define 
ideal  link  sharing  to  set  a  performance  target  for  designing 
such  scheduling  algorithms.  Assume  that  the  link  bandwidth 
is  C  (b/s). 

Definition  1:  We  say  that  ideal  link  sharing  is  achieved  for 
logical  server  n  if,  for  any  time  interval  [Q .  G]  in  which  n  is 


>  ^Wn(t1,t2)  -  am,n.  (18) 

Yn 

They  then  showed  that  delays  of  flow  /  packets  are  bounded 
by 


°j_  +  aph  (/)  (19) 

rf  &)  V‘(/) 

where  ay,  ^  is  the  NB-WFI  guaranteed  to  node  ph(f )  by 
node  ph+1(f)  if  the  flow  is  constrained  by  a  leaky  bucket 
with  pf  <  rj.  Based  on  this  result,  Bennett  and 
Zhang  claimed  that  small  NB-WFI  values  for  the  logical 
servers  are  necessary  for  providing  tight  delay  bounds  to  the 
flows.  They  also  showed  that,  among  all  proposed  packet  fair 
queueing  servers,  a  WF2Q+  server  offers  the  smallest  NB- 
WFI,  namely,  the  length  of  one  packet  when  a  fixed  packet 
length  is  used  by  all  flows. 

The  problem  of  scheduling  packets  for  flows  under  a  link¬ 
sharing  hierarchy  was  also  studied  by  Goyal  el  al.  in  [11], 
where  an  algorithm  called  start-time  fair  queueing  (SFQ)  was 
proposed.  SFQ  is  very  similar  to  the  well-known  self-clocked 
fair  queueing  (SCFQ)  [5],  except  that  in  SFQ  the  virtual 
start  tag  is  used  instead  of  the  virtual  finish  tag  of  a  packet 
as  the  priority  value  of  the  packet.  Goyal  et  al.  observed 
that  the  intermediate  logical  servers  no  longer  have  constant 
service  rates  for  their  children.  Therefore,  they  analyzed  the 
performance  of  SFQ  in  the  context  of  FC  servers  first  defined 
by  Lee  [18]. 

Definition  3:  A  server  is  said  to  be  FC  with  parameters 
(C .  b ),  or  simply  a  FC(C.  b)  server,12  if,  for  all  intervals 
[Q ,  t2\  in  a  busy  period  of  the  server,  the  work  done  by  the 

"Also  called  general  processor  sharing  (GPS)  server  in  the  literature  [21]. 

1  ’  A  constant  rate  server  is  also  FC  with  <5  =  0. 
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server,  denoted  by  fF(£i,£2),  satisfies 


of  L  [1],  it  can  be  shown  that  Sn  is  at  most 


(20) 

In  addition,  they  showed  that  the  service  received  by  a  flow 
from  an  FC  SFQ  server  is  also  FC.  Thus,  if  the  logical  servers 
(called  virtual  servers  in  [11])  are  all  SFQ  servers,  the  packet 
delay  bound  for  a  flow  under  a  link-sharing  hierarchy  can  be 
recursively  computed.  The  exact  bound  is  not  given  in  [11], 


C.  A  General  Approach 

While  the  authors  cited  above  have  made  important  con¬ 
tributions  to  the  design  of  WF2Q+  and  SFQ,  their  approach 
is  not  the  best.  Specifically,  too  much  emphasis  was  put  on 
one  good  fair  queueing  algorithm  for  both  link  sharing  and 
packet  scheduling.  For  future  networks,  heterogeneous  packet 
scheduling  algorithms  may  be  required  at  different  parts  of  the 
link-sharing  hierarchy  to  achieve  multiple  design  goals.  For 
example,  if  implementation  complexity  is  of  primary  concern, 
scheduling  algorithms  like  deficit  round  robin  [24]  may  be 
more  desirable  than  others.  Also,  there  are  a  large  number  of 
performance  results  in  the  literature  for  one-level  servers.  It  is 
not  obvious  from  [1]  and  [11]  how  these  results,  e.g.,  those 
for  weighted  fair  queueing  (WFQ)  servers  [7]  or  even  first- 
in  first-out  (FIFO)  servers,  can  be  extended  to  networks  with 
hierarchical  link  sharing. 

Next  we  describe  an  approach  in  which  link  sharing  and 
packet  scheduling  concerns  are  separated.  In  particular,  a  link¬ 
sharing  hierarchy  is  considered  an  extension  of  a  one-level 
constant  rate  server.  Consider  a  particular  flow  /  under  the 
link-sharing  hierarchy.  Even  with  hierarchical  link  sharing, 
the  flow  is  in  essence  scheduled  by  a  one -level  server,  its 
parent  node,  but  with  a  variable  service  rate.  The  impact  of 
all  nonparent  ancestors  of  /  is  indirectly  accounted  for  by 
the  service  rate  fluctuation  of  the  parent  node.  Therefore,  the 
analysis  of  flow  /  performance  can  be  carried  out  in  two  steps: 

1)  characterization  of  the  service  rate  fluctuation  of  the 
parent  server; 

2)  extension  of  the  performance  results  for  one-level 
servers  to  account  for  service  rate  fluctuations  (char¬ 
acterized  in  the  previous  step). 

For  step  1,  we  characterize  the  parent  server  n  —  parent(f) 
as  an  FC  server.  That  is,  there  exists  a  constant  8n  >  0  such 
that,  for  any  time  interval  [£ i ,  £2]  in  which  n  is  backlogged 
(busy)  throughout,  the  work  (service)  done  by  n  satisfies 


Wn(ti,t2)  >  r„  •  (£2  -  h)  -  Sn.  (21) 

Note  that  the  smaller  8„  is,  the  less  the  service  rate  fluctuation 
is  for  n.  If  all  ancestors  of  n  are  FFQ  servers,  8„  —  0. 
Therefore,  good  approximations  of  FFQ  should  be  used  for 
ancestors  of  n  to  ensure  a  small  value  for  8n.  Let  us  look  at 
two  examples. 

Example  2:  Assume  that  all  ancestors  of  n  are  WF2Q+ 
servers  and  all  packets  have  a  fixed  length  of  L  b.  From 
Definition  1  and  the  fact  that  WF2Q+  guarantees  an  NB-WFI 


E 

h—0 


I  L. 


1  ph  (n) 


(22) 


Let  us  make  the  conservative  assumption  that 
^ p>!  ( 11 )  I* p>!  1  ( n )  —  O.o,  h  —  0, 1 ,  •  •  • ,  H (ti)  1.  (The 
ratio  is  usually  much  smaller  in  reality.)  Then  it  is  easy  to 
show  that  S„  <  2 L. 

Example  3:  Assume  that  all  ancestors  of  n  are  SFQ  servers 
and  all  packets  have  a  fixed  length  of  L  b.  Applying  Theorem 
2  of  [11],  it  can  be  shown  that  S„  is  at  most 


h= 0  v,+1(") 


(Q(h)  +  1) 


L 


(23) 


where  Q(h)  is  the  number  of  branches  that  ph+1(n )  has. 

Let  us  assume  that  rpip,p/rpi,+ipp  —  0.5  and  Q(h)  — 
Q,h  —  0, 1,  •  •  • ,  H(n)  —  1.  Then  S„  could  be  as  large  as 
(Q+2)L.  Therefore,  in  general,  SFQ  causes  larger  service  rate 
fluctuations  than  WF2Q+.  On  the  other  hand,  SFQ  appears 
to  have  a  smaller  implementation  cost. 

For  step  2,  we  next  show  how  to  extend  delay  guarantee 
results  from  constant  rate  servers  to  FC  servers. 

Definition  4:  For  an  FC(C,  8)  server,  its  corresponding 
constant  rate  server  is  defined  to  be  identical  to  the  FC  server 
except  that  it  has  a  constant  service  rate  of  C  b/s. 

Consider  a  class  of  work  conserving  servers,  called  the 
priority  class ,  which  can  be  described  in  general  as  follows.  A 
priority  value  is  computed  and  assigned  to  every  packet  upon 
its  arrival,  and  queued  packets  are  scheduled  for  service  in  the 
order  of  increasing  priority  values.  Ties  between  packets  are 
broken  arbitrarily.  Also,  within  a  particular  flow,  the  priority 
of  each  packet  is  nondecreasing  in  packet  arrival  time. 

Notation:  Consider  an  arbitrary  sequence  of  packet  arrivals 
to  a  priority  server.  (A  packet  arrival  is  represented  by  a  tuple 
consisting  of  the  packet  arrival  time  and  the  packet  size.)  We 
use  the  following  notation: 

A{j>)  arrival  time  of  packet  p  in  the  arrival  sequence; 

s(p)  size  of  packet  p  (in  bits); 

P(p)  priority  value  assigned  to  packet  p\ 

L(j>)  departure  time  of  packet  p. 

In  what  follows,  we  focus  on  a  subclass  called  service 
independent  priority  (SIP)  servers,  defined  below. 

Definition  5:  An  FC(C,8 )  priority  server  belongs  to  the 
SIP  subclass  if  for  every  packet  p  of  an  arrival  sequence  to 
the  server.  Dip)  depends  exclusively  upon  the  value  of  C  and 
packet  arrivals  up  to  and  including  p. 

It  is  easy  to  see  that  FC  versions  of  virtual  clock  [31]  and 
delay-earliest  due  date  (delay-EDD)  [8]  servers  belong  to  the 
SIP  class.  FC  versions  of  servers  that  approximate  a  hypo¬ 
thetical  constant  rate  FFQ  server  (such  as  WFQ  [7],  packet 
generalized  processor  sharing  (PGPS)  [21],  and  WF2Q+  [1]) 
can  be  defined  so  that  their  packet  priority  values  depend  upon 
virtual  times  in  the  constant  rate  FFQ  server  and  are  thus  SIP 
servers  as  well.  FC  FIFO  servers  also  belong  to  the  SIP  class 
since  they  in  effect  use  the  arrival  time  of  a  packet  as  the 
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packet  priority.  (Lee  obtained  some  delay  bound  results  for 
FC  FIFO  servers  with  leaky  bucket  constrained  sources  [18].) 

In  particular,  we  consider  two  types  of  SIP  servers: 
preemptive-resume  with  no  overhead  and  nonpreemptive.  For 
an  SIP  server  that  is  preemptive -resume  with  no  overhead, 
it  will  immediately  stop  the  service  of  a  packet  and  serve  a 
newly  arrived  packet  if  the  new  arrival  has  a  smaller  priority 
value.  But  no  work  will  be  lost  because  of  the  preemption,  i.e., 
when  resuming  service  for  the  preempted  packet,  the  server 
will  start  from  where  it  stopped.  For  an  SIP  server  that  is 
nonpreemptive,  the  service  of  a  packet  cannot  be  preempted 
once  it  is  started. 

Next  we  present  two  theorems  on  extension  of  delay  guar¬ 
antees  for  SIP  servers.13  (Their  proofs  are  in  the  Appendix.) 
They  deal  with  preemptive  and  nonpreemptive  SIP  servers, 
respectively.  We  say  that  a  delay  guarantee  has  the  firewall 
property  if  the  guarantees  to  packets  of  a  flow  are  independent 
of  how  other  flows  behave. 

Theorem  1:  Consider  an  FC(C,8)  SIP  server  that  is 
preemptive-resume  with  no  overhead.  If  its  corresponding 
constant  rate  server  guarantees  a  departure  deadline  of 
P(j>)  +  ff  to  every  packet  p  of  an  arrival  sequence,  where 
[}  is  a  constant,  and  the  guarantee  has  the  firewall  property 
and  is  independent  of  the  priority  tie-breaking  method,  then 
it  guarantees  to  every  packet  p  a  departure  deadline  of 

P(p)  +  P  +  (S/C). 

Corollary  1:  An  FC(C,  8)  FIFO  server  guarantees  a  delay 
bound  of  p  +  (8/C)  to  every  packet  p  of  an  arrival  sequence 
if  its  corresponding  constant  rate  server  guarantees  a  delay 
bound  of  ff  to  every  packet  p. 

For  nonpreemptive  servers,  the  service  may  be  out  of-  order 
sometimes.  It  happens  when  a  newly  arrived  packet  has  a 
priority  value  smaller  than  that  of  the  packet  being  served, 
but  preemption  is  not  allowed. 

Definition  6:  Assume  that  a  nonpreemptive  server  guaran¬ 
tees  to  packet  p  a  deadline  of  D(p).  The  guarantee  is  said  to 
account  for  out-of-order  service  if,  with  preemption,  the  server 
can  guarantee  p  a  deadline  of  D{jj)  —  (smax/ C)  where  smax 
is  the  maximum  packet  size. 

Theorem  2:  Consider  a  nonpreemptive  FC(C,8)  SIP 
server.  If  its  corresponding  constant  rate  server  guarantees 
a  departure  deadline  of  P(p)  +  p  to  every  packet  p  of  an 
arrival  sequence,  and  the  guarantee  has  the  firewall  property, 
accounts  for  out-of-order  service,  and  is  independent  of  the 
priority  tie-breaking  method,  then  it  guarantees  to  every  packet 
p  a  departure  deadline  of  P(jp  +  P  +  (8/C). 

Note  that  both  theorems  and  Corollary  1  are  quite  general. 
They  do  not  depend  on  specific  admission  control  conditions 
or  source  control  mechanisms.  In  contrast,  most  of  Lee’s 
analyses  on  FIFO  FC  servers  [18]  were  done  for  leaky  bucket 
constrained  sources. 

Example  4:  This  example  illustrates  that  the  extended  dead¬ 
line  is  as  tight  as  the  original  deadline.  Consider  a  virtual  clock 
server  [31]  that  is  FC(1,  4).  It  is  currently  shared  by  two  flows; 
flow  A  has  a  reserved  rate  of  1/6  and  flow  B  has  a  reserved  rate 
of  5/6.  Assume  that  both  flows  use  a  fixed  packet  length  of  1. 

13  Note  that  the  servers  need  to  perform  some  form  of  admission  control  to 
provide  meaningful  delay  guarantees. 


The  first  packet  of  flow  A  arrived  at  time  0  and  the  first  packet 
from  flow  B,  denoted  as  p,  arrived  at  time  1/5.  A  constant  rate 
virtual  clock  server  with  a  unit  service  rate  guarantees  to  p 
a  departure  deadline  of  P(p)  +  1  =  1.4  +  1  =  2.4  [27],  p 
actually  will  depart  from  the  constant  rate  server  at  2,  which 
is  close  to  2.4.  From  Theorem  2,  the  FC  server  guarantees  a 
deadline  of  2.4  +  4  —  6.4  to  p.  The  FC  server  could  be  idle 
from  time  1  to  5  and,  in  that  case,  p  would  not  depart  from 
the  FC  server  until  time  6,  which  is  close  to  6.4. 

The  burst  (block)  delay  bounds  presented  in  Section  II-B 
were  derived  from  a  delay  guarantee  of  virtual  clock  servers 
that  has  the  firewall  property,  accounts  for  out-of-order  service, 
and  is  independent  of  priority  tie-breaking  method  [27].  Using 
Theorem  2,  it  is  straightforward  to  extend  them  to  networks 
with  hierarchical  link  sharing. 

Corollary  2:  Consider  the  end-to-end  burst  delay  bounds 
presented  in  Section  II-B.  If  at  each  switch  k  the  packets  of 
the  flow  are  served  by  a  FCPCf..  bjf)  logical  server  instead 
of  a  constant  rate  link  with  capacity  6) . ,  then  the  end-to-end 
delay  of  burst  m  of  the  flow  is  bounded  as  follows: 
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V.  Concluding  Remarks 

Although  our  admission  control  algorithm  makes  decisions 
entirely  based  on  available  bandwidth,  our  work  is  very 
different  from  those  assuming  a  bufferless  model.  Specifically 
for  real-time  block  services,  packets  are  buffered  at  a  channel 
while  waiting  for  their  turn  to  be  transmitted.  With  block-based 
admission  control,  the  amount  of  buffer  space  required  for  a 
flow  is  bounded,  and  can  be  precisely  computed  based  on  the 
maximum  block  size,  the  interpacket  spacing  within  a  block, 
and  the  flow’s  delay  bound  for  the  channel.  In  this  paper,  we 
have  assumed  that  sufficient  buffer  is  allocated  to  each  flow 
so  that  no  packet  loss  occurs  due  to  buffer  overflow. 

In  addition  to  admission  control  and  link  sharing,  several 
important  components  of  real-time  block  transfer  services  are 
still  under  developement.  Recently  we  have  investigated  how 
to  incorporated  active  loss  management  techniques  into  our 
framework  so  that  losses  can  be  distributed  more  evenly 
among  flows  subscribing  to  the  same  class  of  statistical  service. 
The  results  are  reported  in  [28],  which  also  contains  more 
discussions  of  related  work.  We  will  also  look  into  using  other 
statistical  tools  (such  as  the  large  deviation  theory  [23])  to 
improve  the  admission  control  accuracy. 


Appendix 

Proof  of  Theorem  1:  Define  N(p)  as  the  set  of  packets  in 
the  arrival  sequence  whose  priority  values  are  less  than  or 
equal  to  that  of  packet  p  and  Wp(t  1C2)  as  the  total  work  (in 
bits)  done  by  the  FC  server  for  packets  in  N(jp)  in  the  time 
interval 
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We  will  carry  out  a  proof  by  contradiction.  Specifically,  we 
assume  that  there  exists  a  packet  p  in  the  arrival  sequence 
such  that 

Lip)>P{p)  +  ft+^.  (26) 

Then  we  will  show 

wJ 0, P{p)  +P+^)>  E  (27) 

'  '  ?€JV(p) 

which  contradicts  (26). 

We  use  the  superscript  c  to  label  a  term  defined  for  the 
constant  rate  server.  By  definition  of  SIP  servers,  p  has  the 
same  priority  value  P(p)  in  both  server  systems.  Thus,  we 
have 

N\p )  =  Nip).  (28) 

From  (26),  there  exists  a  time  period  of  which  [A(ji).  Pij>)  + 
ft  +  {S/C)]  is  a  subinterval  and  in  which  the  FC  server  is 
continuously  busy  with  Nip)  packets.  Letp*  be  the  packet  that 
started  this  time  period.  (Note  that  p*  may  be  p  itself.)  In  other 
words,  there  is  no  other  Nip)  packet  in  the  FC  system  when 
p*  arrived.  Consider  the  set  of  packets  in  N(p)  that  arrived 
prior  to  Aip*)  in  the  FC  system  (same  as  the  packets  in  Nc{p) 
that  arrived  prior  to  Aip*)  in  the  constant  rate  system).  Since 
all  of  them  have  been  served  in  the  FC  system  by  Aip*)  (but 
not  necessarily  in  the  constant  rate  system),  we  have 

Wp{0,  A(p*))  >  Wp{0,  A(p*)).  (29) 

Since  the  FC  server  is  busy  exclusively  with  packets  in  Nip) 
throughout  the  interval  [Aip*),  Pip)  +  ft  +  {S/C)],  we  have 

Wp^A(p*fP(p)  +  p+^j 


>  c- 

(pW  +  P+^-Aip^-S 

(30) 

—  c  ■ 

{ Pip)  +  ft-A{p *)) 

(31) 

=  W' 

ftAip*),Pip)  +  ft). 

(32) 

Combining  (29)  with  (32),  we  have 

WP(o,Pip)  +  p  +  ^j 

>  w;{0,  A{p*))  +  Wc{A{p*),  Pip)  +  ft).  (33) 

Consider  the  constant  rate  server  and  a  modification  to  the 
arrival  sequence  as  follows.  Those  packets  in  Nc(p)  that  arrive 
at  or  after  A{p)  +  ( s(p)/C )  in  the  original  sequence  will  have 
their  arrival  times  moved  forward  in  the  modified  sequence 
such  that  they  all  arrive  in  the  time  interval  ( A(p ),  A(p)  + 
{s(p)/C))  and  the  original  order  of  arrivals  for  each  flow  is 
preserved.  Note  that  the  priority  value  of  p  as  well  as  the 
guaranteed  deadline  of  p  are  unaffected  by  the  modification. 
Moreover,  since  the  priority  value  of  each  packet  within  a 
flow  is  nondecreasing  in  packet  arrival  time  (by  definition  of  a 
priority  class  server),  Nc(p)  is  unaffected  by  the  modification 
as  well. 

The  deadline  guarantee  of  P{p)  +  ft  by  the  constant  rate 
server  still  holds  for  the  modified  packet  arrival  sequence 


because  of  the  firewall  property.  Moreover,  the  guarantee  is 
independent  of  the  tie-breaking  method.  Therefore,  it  would 
hold  even  if  p  finished  service  last  among  all  Nc(p)  packets. 
Also  with  the  modified  arrival  sequence,  the  most  amount 
of  work  that  the  constant  rate  server  can  do  for  Nc{p) 
packets  in  the  time  interval  [0, Pip)  +  ft)]  is  Wp{ 0,  Aip*))  + 
Wc{A(p*),  Pip)  +  ft).  Therefore,  the  following  must  hold: 

W;{0,A(p*))  +  Wc{A(p*ftP(p)  +  ft)>  E  *(<?)•  (34) 

?SArc(p) 

Combining  (33)  and  (34),  we  have 

WP(o,Pip)+ft+p)>  E  *(<?)•  (35) 

'  '  q<ENpp) 

Because  of  (28),  Nc(p)  can  be  substituted  by  Nip)  in  (35). 
Therefore,  (27)  holds. 

Proof  of  Corollary  1:  For  FIFO  servers,  Pip)  —  Aip)  for 
all  p.  Therefore,  a  delay  bound  guarantee  of  ft  is  equivalent  to 
a  deadline  guarantee  of  Pip)  +  ft.  The  proof  for  the  deadline 
guarantee  generalization  is  identical  to  the  one  for  Theorem 
1  except  for  a  simpler  reasoning  for  (34):  While  a  deadline 
guarantee  by  a  FIFO  server  does  not  have  the  firewall  property, 
there  is  no  need  for  the  condition  because  all  packets  in  Nc{p) 
arrive  no  later  than  A{p),  therefore,  we  can  reason  for  (34) 
with  the  original  arrival  sequence. 

Proof  of  Theorem  2:  Again  we  carry  out  a  proof  by  contra¬ 
diction.  Specifically,  we  assume  that  for  the  nonpreemptive  FC 
server  there  exists  a  packet  p  in  the  arrival  sequence  such  that 

L(p)>P(p)  +  ft+^.  (36) 

Then  we  will  show 

WPUpip)  +  ft+p)>  E  *(<?)  (37) 

'  '  ?SAr(p| 

which  contradicts  (36). 

In  this  proof,  we  also  consider  a  third  SIP  server,  which  is 
identical  to  the  constant  rate  server  except  that  it  is  preemptive- 
resume  with  no  overhead.  We  use  the  superscripts  c  and  pie 
to  label  terms  for,  respectively,  the  nonpreemptive  and  pre¬ 
emptive  constant  rate  server  systems.  Note  that  by  definition 
of  SIP  servers,  p  has  the  same  priority  value  Pip)  for  all  three 
server  systems,  i.e., 

Npveip)  =  Nc{p)  =  Nip).  (38) 

It  is  given  that 

Lcip)<Pip)+ft  (39) 

and  the  guarantee  in  (39)  accounts  for  out-of-order  service. 
Therefore,  we  have 

L^ip)<Pip)+ft-S-PPA.  (40) 

Note  that  the  above  guarantee  also  has  the  firewall  property. 


40 


IEEE/ ACM  TRANSACTIONS  ON  NETWORKING,  VOL.  6,  NO.  1,  FEBRUARY  1998 


Similar  to  the  proof  for  Theorem  1,  we  can  show  that  there  Theorem  4:  For  any  random  variable  X  that  can  take  value 
exists  a  packet  p*  6  N(p)  such  that  between  0  and  Xmax,  the  following  holds: 

WP{ 0,  Aip*))  >  W^{0,  Aip*))  (41)  var[X]  <  (h  -  1  )E[Xf  (52) 

wTa(j>*),  P(p)+f3+^  )  >  C  •  {P(p)+f3-A(p*))-sm^.  where  h  is  the  peak-to-average  ratio  (A"max/£[X]). 

V  /  Proof:  By  definition,  the  variance  of  X,  denoted  by 

(42) 

var[A'],  satisfies 


The  extra  term  smax  in  (42)  is  due  to  the  fact  that  a  part  of  the 
work  done  by  the  nonpreemptive  FC  server  in  [A{p*  ).  P{p)  + 
[3  +  1 5/(7]  may  be  for  out-of-order  service  of  a  packet  not  in 

N(p )• 

Similar  to  the  proof  for  Theorem  1,  we  have 

W}r(0,  Aip*))  +  W^(A(p*f  Pip )  +  (3  -  ^p) 

>  e  (43> 

qei\rpi°(p) 

Moreover 

W^(A{p*),P(p)  +  f3-^) 

=  C  ■  (p(p)  +  (3  -  ^  -  Aip*))  (44) 

=  C  ■  {Pip)  +  [i-  A{p*))  -  w  (45) 


var[A]  =  J  x2  dF{x)  -  £[X]2-  (53) 

Since  x2  <  Amax  •  x,  we  have 

var[A]  <  J (Amax  •  x)  dF{x)  -  E[ A]2  (54) 

=  xmax.  J  x  dF(x)  -  E[X]2  (55) 

=  XmsKE[X\-E[X]2  (56) 

=  (/i-l)£[X]2.  (57) 

□ 
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Combining  (43)  and  (45),  we  have 

W}r  { 0,  Aip*))  +  C  •  ( Pip )  +  13-  Aip*))  -  w 

>  (46) 

q^NP^(p) 

From  (41),  (42),  and  (46),  we  have 
Wp(o,Pip)  +  /3+^j 

=  Wp{ 0,  Aip*))  +  WP  (Aip*),  Pip)  +  (3+^j  (47) 

>  wr(°.  a(pj*^  +  a  +  P  ~  a(p *))  -  W 

(48) 

>  E  (49) 

qe  ArPre(p) 

Because  of  (38),  Npl'e{p)  can  be  substituted  by  N (p)  in  (49). 
Therefore,  (37)  holds.  □ 

Theorem  3  ([19]):  Assume  that  _Yi,A2,  •••  are  indepen¬ 
dent  random  variables  centered  at  expectations  with  distribu¬ 
tion  functions  (d.f.’s)  1\ .  F>.  •  •  •  and  variance  a2,  a2. 

Let  S„  —  X / ,.  be  their  consecutive  sums  with  variance 

s2  -  of ,  n  =  1,  2,  •  •  • .  Then 

—  ->  iV(0, 1)  (50) 

sn 

if  and  only  if  for  all  e  >  0 

tn(e)  -  ~2  E  /  x2dFk  — >  0  (51) 

□ 
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